Annotating Similes in Literary Texts
نویسنده
چکیده
Annotated corpora are invaluable resources for researchers in the humanities: on the one hand, for natural processing tasks, they can serve as standards against which results from new automatic methods can be measured; on the other hand, in corpus-based studies, they enable either to answer existing research questions or to explore original ones. In this respect, some annotation frameworks such as the Text Encoding Initiative (TEI) attempt to standardise annotation practices in order to facilitate data reuse and exchange. However, despite the crucial role played by figurative language in general and similes in particular in language, no consensus has been reached so far on how to comprehensively annotate them in literary texts. The present paper proposes a framework for annotating similes in literary texts which takes into consideration their semantic and syntactic characteristics as well as the challenges inherent to the automatic detection of similes.
منابع مشابه
"Pale as death" or "pâle comme la mort": Frozen similes used as literary clichés
The present study is focused on the automatic identification and description of frozen similes in British and French novels written between the 19 century and the beginning of the 20 century. Two main patterns of frozen similes were considered: adjectival ground + simile marker + nominal vehicle (e.g. happy as a lark) and eventuality + simile marker + nominal vehicle (e.g. sleep like a top). Al...
متن کاملThe intellectual Simile in Tarikh-e-Vassaf
Simile, which appears in different modes, is one of the major subjects of a branch of rhetoric called Bayan. Some of these modes, such as intellectual-intellectual and sensible-intellectual are not in harmony with the illuminating role of simile; therefor, the books on Bayanhardlyhave positive attitudes towards these modes of simile, but contrary to this attitude, such similes appea...
متن کاملA Flexible NLP Pipeline for Computational Narratology
Temporal dependencies reveal interesting insights into the semantic discourse structure of narrative texts. The investigations of literary scientists are, as of today, mostly based on labor-intensive manual annotations. Computational Narratology, an important subtopic of the Digital Humanities, aims at facilitating annotations and supporting literary scientists with their analyses. According to...
متن کاملAnnotating Characters in Literary Corpora: A Scheme, the CHARLES Tool, and an Annotated Novel
Characters form the focus of various studies of literary works, including social network analysis, archetype induction, and plot comparison. The recent rise in the computational modelling of literary works has produced a proportional rise in the demand for character-annotated literary corpora. However, automatically identifying characters is an open problem and there is low availability of lite...
متن کاملInvestigating the stylistic relevance of adjective and verb simile markers
Similes are figures of speech in which the similarities as well as the differences between two or more semantically unrelated entities are expressed by means of a linguistic unit. This unit, also called marker, can either be a morpheme, a word or a phrase. Since similes rely on comparison, they occur in several languages of the world. Depending on the marker used and of the semantic or structur...
متن کامل